Entity Ambiguity Resolution with Topic Models
نویسندگان
چکیده
We address the problem of ambiguity resolution in associating facts to the entities with multiple senses. A semi-supervised graphical model assigns the best categories for entities given the context. Problem Statement Analyzing the results of current algorithms for extracting facts about entities [1] shows that despite the mutual exclusion constraints that are enforced, there is still a need for resolving ambiguity and determining which set of facts belong to what sense of the entity. For example, for the entity “Tigers” we have the categories “Mammal” and “Sports Team”. While there is a mutex rule in the seed ontology for these two categories, the entity contains extracted fact for “teamPlaysAgainstTeam” which only applies to “Sports Team” sense of this word. In this project, we will add the support for multiple senses of the entity and also define the learning method that can recognize when ambiguity exists and also the ability to associate facts with the correct sense of the entity. This work can also potentially allow us to infer the mutex rules from the data.
منابع مشابه
Automatic Term Ambiguity Detection
While the resolution of term ambiguity is important for information extraction (IE) systems, the cost of resolving each instance of an entity can be prohibitively expensive on large datasets. To combat this, this work looks at ambiguity detection at the term, rather than the instance, level. By making a judgment about the general ambiguity of a term, a system is able to handle ambiguous and una...
متن کاملRelative clause attachment ambiguity resolution in Persian
The present study seeks to find the way Persian native speakers resolve relative clause attachment ambiguities in sentences containing a complex NP of the type NP of NP followed by a relative clause (RC). Previous off-line studies have found a preference for high attachment in the present study, an on-line technique was used to help identify the nature of this process. Persian speakers were pre...
متن کاملComputational Complexity of Probabilistic Disambiguation
Recent models of natural language processing employ statistical reasoning for dealing with the ambiguity of formal grammars. In this approach, statistics, concerning the various linguistic phenomena of interest, are gathered from actual linguistic data and used to estimate the probabilities of the various entities that are generated by a given grammar, e.g., derivations, parse-trees and sentenc...
متن کاملEvaluation of the Regularization Algorithm to Decorrelation of Covariance Matrix of Float Ambiguity in Fast Resolution of GPS Ambiguity Parameters
Precise positioning in Real Time Kinematic (RTK) applications depends on the accurate resolution of the phase ambiguities. In RTK positioning, ambiguity parameters are highly correlated, especially when the positioning rate is high. Consequently, application of de-correlation techniques for the accurate resolution of ambiguities is inevitable. Phase ambiguity as positioning observations by the ...
متن کاملGeneric Statistical Relational Entity Resolution in Knowledge Graphs
Entity resolution, the problem of identifying the underlying entity of references found in data, has been researched for many decades in many communities. A common theme in this research has been the importance of incorporating relational features into the resolution process. Relational entity resolution is particularly important in knowledge graphs (KGs), which have a regular structure capturi...
متن کامل